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The sequencing of the genomes of sev- 
eral organisms, including human, is pro- 
ceeding at a dramatic pace. However, 
the function of the protein products of 
many of the newly identified genes is 
unknown. Cells typically express several 
thousand different proteins, and each of 
these may be subject to a wide variety 
of dynamic posttransfational modifi- 
cations, effectively increasing the num- 
ber of different proteins present. Al- 
though not yet being undertaken on the 
same scale as the DNA- sequencing pro- 
grammes, a number of pro jects aimed 
at characterizing all the PROTeins ex- 
pressed by the genOME of an organism 
- the 'proteome' - are now under way. 
Mere, the authors discuss some impor- 
tant recent devekxxnents in the compre- 
hensive analysis of protein expression 
* proteome analysis - which together 
have put this approach at a watershed. 
Current progress suggests that the data 
it will generate may alter fundamen- 
tally bur understanding of the molecu- 
lar composition arid function of cells. 

There Is widespread awareness of the 
genome- sequencing protects that have 
been completed or are currently under 
way (Table 1). The recent announce- 
ment of the complete sequence of an 
archaeon (Methanococcus jannaschh~) } 
means that comprehensive genome- 
sequence comparisons of bacteria, 
archaea and eukarya are now possible. 
The most ambitious of the as-yet- 
incomplete sequencing projects, the 
human genome project, was Initiated 
in the USA in 1 990 and now Involves 
laboratories in the UK, France, Germany 
and fapan. It is estimated that it will 
cost in excess of S3 billion over a 
period of 1 5 years. Initially, genetic and 
physical mapping was a priority, but 
now the second phase - sequencing - 
has begun and has an anticipated 
completion date of a point during the 
first decade of the next millennium 2 . 
For obvious reasons, the mapping and 
sequencing of the human genome 
attracts considerable interest not least 
from commercial organizations, which 
see enormous (financial) potential 
in the information that the project 
will generate. 



The progress of existing sequencing 
projects has been dramatic: they are 
producing data for new genes at an 
exciting and ever-increasing rate. 
Exciting because, for example, the 
recently accomplished complete se- 
quence (-12.5 Mb) of the eukaryotic 
budding yeast Sacchoromyces cere- 
visiae has revealed that approximately 
half of the genes (2964) are previously 
unidentified, and of these about 2300 
show no significant sequence hom- 
ology to known genes and hence are 
of completely unknown function 1 . 
Moreover, the function of many of the 
new genes that show some sequence 
homology to known genes remains 
elusive. It has become apparent, there- 
fore, that this huge and rapidly grow- 
ing repository of information rep- 
resents a starting point for the more 
challenging aim of understanding 
completely how a simple eukaryotic 
cell such as 5. cerevt'sfoe functions 3 - 4 . 
This will undoubtedly be superseded 
by the quest for a similarly detailed 
understanding of how higher eukary- 
otic cells and a complex organism, the 
human, function. Attention will turn 
increasingly to the protein products 
of the genes. 

Although less publicized than their 
genome counterparts, a number of 
proteome projects have been started, 
and in some cases significant progress 
has been made (see Table 1 ). In these 
projects, complex mixtures of proteins 
are reserved by two-dimensional (2-D) 
gel electrophoresis, and individual 
proteins characterized by a growing 
array of methods. Recent develop- 
ments have made important improve- 
ments in both the speed and sensitiv- 
ity, with which such analyses can be 
undertaken (see Refs 5-6). In many 
Instances, the methods used to char- 
acterize proteins present on a 2-D gel 
pattern are dependent on the exist- 
ence of DNA sequence data - genome 
and proteome analysis are therefore 
(intimately) linked. The continued 
integration of proteome and genome 
data will have a fundamental impact 
on our understanding of the normal 
functioning of cells and organisms. 



Reducing redundancy - the need 
to analyse protein expression 

DNA sequence data alone cannot be 
used to build, a 'molecular definition' 
of a cell. The DNA sequence data them- 
selves reveal little or nothing about the 
level of expression of proteins, the pro- 
tein Uo forms that may be produced 
from each gene (by alternative splic- 
ing) or the extent to which protein(s) 
are posttranslationalty modified. Nor 
can they provide Information on 
the cellular or subcellular distribution 
of proteins. It is precisely this type of 
information that proteome analysis 
will provide. 

One of the immediate impacts of 
genome sequence data is the ability to 
undertake mass screening of mRNA 
expression, and to this end a number 
of impressive approaches are being 
developed. Current progress includes 
the application of cDNA microarrays 9 
and serial analysis of gene expression 10 . 
These should provide information on 
which mRNAs are expressed and there- 
fore which proteins are likely to be 
expressed. If sufficient selectivity can 
be achieved, and recent reports sug- 
gest that it can 11 , they should also pro- 
vide important information on the pres- 
ence of mutated forms of genes and 
so could be particularly useful for diag- 
nostic screening. Strategies are also 
being developed to establish the bio- 
logical function of unknown genes 12 . 
However, it is clear that even with 
these approaches it is necessary to 
undertake comprehensive analysis of 
protein expression. Fortunately, the 
past two or three years have seen 
important developments In a number 
of techniques now enabling this to 
be undertaken. 

There are two (concurrent) phases to 
proteome analysis. First, all the pro- 
teins expressed by a particular organ- 
ism, tissue or cell under 'normal' con- 
ditions are Identified and their position . 
on 2-D gels mapped - this defines 
the 'constitutive' proteome. The infor- 
mation can be used to generate a 2-D 
reference map and database for the 
organism, tissue or cell examined. A 
number of 2-D reference maps and 
databases (see Fig. 1) are being de- 
veloped and are accessible through 
the World Wide Web and Internet 
(Table 2). In some cases, these data- 
bases are already Impressive; thus, for 
example, of the estimated 3500 or so 
proteins expressed in human keratino- 
cytes, approximately 30% (1 082) have 
been identified and are recorded in 
the 2-D reference database for this 
cell 11 . Similarly, 630 protein spots 
present on 2-D gels of human plasma 
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TABLE 1 - PttOCRESS OF SELECTED GENOME AND PROTEOSE PROiECTs> 




Organism 


Genome 
size (W)b) 


Estimated proteome 
size (gene number) 


Protein entries 
In SWISS PROT 


Proteins annotated 
on 2 D maps 


• 


Mycoplasma genitafium b 
Haemophilus influenzae** 
Escherichia co// b 
Methanococcus jannaschii* 
Saccharomyces cerevisiae* 
Dktyostelium discoideum 
Arabidopsis tholiana 


0.58 
1.83 
4.72 
1.66 

12.5 

70 

70 


470 
1743 
4000 
1738 
6000 
12 500 
14 000 


344 
1578 
3654 
62 
3720 
217 
541 


>S0 
> 1 uu 
>300 

>250 
>50 




Caenorhabdttis efegans 
Homo sapiens 


80 
2900 


17 800 
60 000-80 000 


1030 
3719 


50 
>1500 





genome-sequendng projects are under way or completed. enS ' 0na ' referenCe ma P s for J °™ organisms for which 
b Genome sequences that have been completed. Note that there i; » Ian f ™« . w 

as all submissions are checked thorough and ^Mp^^^l^'^^^^^^^"^ 
modifications wi.1 increase the tota. number of i^dSZ^T^^, " ° n0teworth * Posttrans.ationa. 
1 2. to , „o,d increase over gene number, perhaps ^^1^^^^^^ *" ^ * 3 





have been identified (Table 2, SWISS 
2D PACE). The achievements of these 
projects support the feasibility of pro- 
teome analysis. 

The 2-D reference maps will serve as 
a foundation on which the changes 
in pr teome expression will be ana- 
lysed. In the longer term, it is antici- 
pated that all this information will be 
readily accessed and integrated with 
the other databases. More tangibly, 
the 2-D reference maps and databases 
should enable other groups to identify 
proteins on 2-D gels by computer- 
driven comparison of an image of their 
2-D gels with those in the databases 
(see Fig. 1 ). For example, those study- 
ing t xic responses of the liver to 
drugs' 4 may exploit a liver 2-D refer- 
ence map (Table 2) to identify the pro- 
teins that change in response to indi- 
vidual treatments. To some extent, 
this may ultimately avoid unnecessary 
duplication of the effort required to 
characterize individual protein spots 
on 2-D gels. 

In its 'second phase', proteome 
analysis involves the measurement of 
changes in the proteome that occur in 
response to changing physiological 
conditions. This includes analysis of 
the quantitative changes in protein ex- 
pression, changes in posttranslational 
modifications and, where appropriate, 
chang s in subcellular localization of 
proteins. At least initially, this Is likely 
to be focused on groups or clusters 
of proteins rather than the entire 
proteome. It may be that much f this 
analysis will rely on existing antibody- 
based methods and therefore not be 
dependent on 2-D separati n of 
proteins. 

trends in CELL BIOLOCY (Vol. 7) April 1997 



Using 2-B> electrophoresis (2-DE) 
Ho affiaffys© protein expression - 
the povtKBF and problems off 

Much of proteome analysis is depend- 
ent on the ability to separate complex 
protein mixtures rapidly, and 2-DE is 
undoubtedly the most powerful 
method currently available to achieve 
this. Estimates of the number of poly- 
peptides that can be resolved on a 
single gel vary between 2500-1 0 000 
(see Ref. 8). Moreover, 2-DE, combined 
with appropriate computer analysis 
of gel images 15 , enables quantitative 
analysis of the amount of individual 
proteins within such mixtures. The 
method also potentially allows simul- 
taneous purification of thousands of 
proteins, unlike conventional protein- 
purification strategies, which are focused 
on the production of one or a few indi- 
vidual proteins. Moreover, the 2-DE- 
based approach allows proteins to be 
purified without any prior knowledge 
of their properties. 

One of the biggest differences be- 
tween conventional and 2-D protein 
purification is the amount of protein 
that is purified and therefore available 
for analysis. In the past, the low load- 
ing capacity of 2-D gels has severely 
limited its application as a purification 
step. Even with this limitation, signifi- 
cant characterization of proteins from 
2-D gels has been achieved with 
methods such as immuno blotting, 
N-terminal and internal peptide se- 
quencing, comigrati n f unknown 
pr teins with known proteins, or over- 
expression of homologous genes of 
interest in the organism under study 
(see Ref. 1 3). However, it is now poss- 
ible to apply milligram quantities of 



protein mixtures onto immobilized pH 
gradient (IPC) isoelectric-focusing (IEF) 
gels, so producing micropreparative 
2-DE and enabling the purification of 
microgram amounts of 200-300 pro- 
teins 15 . The more recent use of narrow- 
range IPC gels (1 .0 pH units) allows up 
to 1 5 rng of protein to be applied to a 
single gel; thus, using a series of these 
gels (of different pH ranges) could 
allow the purification of as many as 
3000 proteins at 5 u.g per protein 16 . 
These improvements have extended 
the applicability of 2-DE for the prep- 
aration of proteins for analysts. How- 
ever, it should be noted that some pro- 
teins are not readily solubilized by 
existing IEF buffers and that some high 
(>200 kDa) and low (<1 0 kDa) molecu- 
lar mass proteins are not adequately 
resolved by 2-DE. Despite these limi- 
tations, it seems evident that 2-DE can 
be used to undertake comprehensive 
protein analysis, and recent develop- 
ments have accelerated efforts to pro- 
duce new improved methods. 

Clearly, the potential success of iden- 
tifying proteins by comparison of their 
positions on a 2-D gel with locations 
on a 2-D reference map rests on the 
consistency and quality of the 2-D 
separations. In the past, this has posed 
problems, but current protocols based 
on IPCs provide spatially and quanti- 
tatively reproducible 2-D patterns 
both within a laboratory and, more 
importantly, betwe n laboratories 1718 . 
Despite these significant improve- 
ments, the r utine production f high- 
quality gels on the scale required 
for comprehensive documentation of 
proteome changes still requires exten- 
sive practical expertise. This potentially 
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Some features of a two-dimensional database - the HSC 2DPArP r 7~- 77 " FtCURE 1 

NHU hnprtd CCege Schoo, of M-ch. ^HareMd Hoj^taf UK^^^^ 
results in a list of the databases containing an entry for thi 'term T ft* T^rln I . 1 C ^ te " n ' trioie I****** isomerase' (or 'tnoseT 

spot found in the human ventnCe. (!) This Sedt he^rSein £j i" J^SlLS T * ^ " °' f " * e P"** 

accessed direct* by -clicking- on , protein spot , n the reference 0 £3ge (f> t^S^I^^T? (2 ' D> 9 " ir ™ 9e {<0 Datab " e ent "" eS ** 

Database (g), Medline, SW! 5 5- 2D PACE, OMIM and PROSITE. 



limits the applicability of 2-OE. A fully 
automated 2-0 'machine' would be 
very useful, and the development of 
such Instrumentation is under way in 
several laboratories (see, for example, 
Ref. 1 9). The now obvious potential 
of proteome analysis will undoubtedly 
result in others attempting to develop 
automated instrumentation. Alterna- 
tively, the use of precast gels with 
both first-dimension IPC and second- 
dimension SDS-polyacrylamide gels 
m an alf-in-one unit, requiring just a 
swapping of electrode positions to run 
the two dimensions, might provide 
highly reproducible separations that 
would be technically less demanding 
and perhaps more readily applicable 
to automation. 
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Recent breakthroughs in protein 
Identification and 
characterization 

Unequivocal identification of a pro- 
tein is most convincingly achieved by 
direct determination of its primary 
sequence, and until recently protein 
identification relied almost entirely on 
Edman degradation. With the increase 
in ONA and protein sequence infor- 
mation (Table 1), now only limited 
N-terminal or internal sequence, typi- 
cally 1 0-20 amino acids, may be suffi- 
cient to identify many proteins. Even 
so, sequencing by Edman degradation 
is limited by a relatively low sample 
throughput of usually one or two pep- 
tides per day. Increasingly, groups are 
now using Identification strategies 



that involve the combination of many 
pieces of quickly, economically and 
more easily obtained data to identify 
proteins with confidence (see Refs 5-8 
and 15). This has led to the develop- 
ment of integrated approaches using 
combinations of amino acid composi- 
tion, peptide mass fingerprinting, 
Edman sequencing and tandem mass 
spectrometry (Table 3). These strat- 
egies also often use the estimated mol- 
ecular weight and pi of the protein. 
So, for example, data for the amino 
acid composition or the masses (rapidly 
measured by mass spectrometry) of 
peptides generated by enzymatic deav- 
age of the protein can be combined 
with the molecular mass and pi of a 
protein spot to 'search' the SW? SS-PROT 
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TABLE^Q^^^ AND ^, EW DATABA SES OF REL^tq P^qmITnaLYSK. 



Database 



Description 



Address 



2 D gel databases 



Biobase 
EC02 DBASE 
Heart-2DPACE 
HSC-2DPACE 



LSB- 



QUEST 



SWISS-2DPACE 



Yeast 



Yeast 



Keratinocytes, bladder cancer, etc. 
(Danish Centre for C nome Research) 

E . coti 

(in NCBI repository) 
Human myocardial 

(Berlin, Cermany) 
Human heart 

(Heart Science Centre, Harefield, UK) 

— Human f -mouse and rat liver,_etc 

(Large Scale Biology, USA) 
Ref52 cells, mouse embryo, yeast 
(Cold Spring Harbor Lab., USA) 
10 human maps including: liver, plasma, 
erythrocytes, platelets. Also, yeast, £. coli 
(University Hospital, Geneva, Switzerland) 
5. cerevisiae, S. pambe, etc. 
(Cbteborg University, Sweden) 
5. cerevisiae 
(Proteome Inc., USA) 



Protein sequence databases 



PDD 
PIR 

ProDom 
PROSITE 
SWISS-PROT 
YPD 



Protein Disease Database 
Protein Identification Resource 
- protein sequences 
Protein Oomains and families 
Dictionary of Protein Sites and patterns 
SWISS-PROT annotated protein sequences 
Yeast Protein Database 



3-D protein structure databases 

PDB Brookhaven Protein Data Bank 

SC0P Structural Classification Of Proteins 

SWISS-3DIMACE 3-D images of proteins (and other molecules) 

Cen me and DNA databases 

EM8L EMBL-EBI nucleotide sequences 

CDB Genome database 

CenBank GenBank nucleotide sequences 

CSDB Genomic sequences 

Useful Indexes and lists of links to protein related sites 

Amos's links 
Bio-wURLd 



EBI 

ENTRE2 
ExPASy 
Pedro's 

homepage 
SIR 



Amos Bairoch's WWW links page 
Searchable collection of molecular 
biology URLs 
EBI homepage 

Expert Protein Analysis System 
BioMolecular Research Tools 

Structural Information Resources; list 
of URLs on macro molecular structure 



ht tp : / /biobase . dk/cgi - bin/cel i s 

f tp : / /ncbi . nlm . run . gov/reposi tory/EC02 DBASE 

http://www.chenue.eu-berlin.de/user/pleiss/ 

http : / /www. harefield . nthames .nhs.uk/ 
nhl i /protein/ index . hanl . 
^http:./V,lsbc,AOT/^^ps/patterns .html 

http: //siva . cshl . org/ 

ht tp : / /expasy . hcuge ch/ch2d/ch2d- top . html 

http: / /yeas t - 2dpage . gmn . gu . se 
ht tp : / /www . proteome . com/ YPDhome . html 



http : / /www-pdd . nc if erf . gov/ 
ht tp : / /www . gdb . org/ Dan/proteins /pir . html 

http : //www . Sanger . ac . uk/~esr/prodom . html 
http : //expasy - hcuge . ch/sprot/prosite . html 
http :/ /expasy . hcuge . ch/sprot /sprot- top . html 
http : / /www . proteome . com 



http://www.pdb.bnl.g-ov/ 

ht tp : / / scop . mrc- lmbcam .ac.uk/ scop 

http://expasy.hcuge.ch/sw3d/sw3d-top.html 

http://www.ebi .ac.uJc 

http: //gdhwww.gdb.orc// 

http: / /gdk^ww . gdb . org 

http /www. ncgr . org /gsdb/gsdb. html 



http : / /www/amos_www_l inks . html 
http: //ebi -ac.uk/htbLn/wurld.pr 

http://www.ebi.ac.uk 

http:/ / www3 . ncbi . nlm . nih . gov/En tr e z 

ht tp : / /expasy . hcuge . ch 

http://www.public.iastate.edu/-pedro/ 
research_tools . html 

http : / /www. ucmb. ulb. ac . be /StxctRe sources 



database and generate a list of best- 
matching proteins. The probability of 
achieving a correct identification is in- 
creased dramatically by the combination 
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of two or more approaches to gener- 
ate the data used to search the se- 
quence databases (Table 3). It might 
be assumed that pr tein identification 



by homology searching would n t be 
applicable to organisms for which there 
is little DNA (or protein) sequence infor- 
mation; however, recent attempts at 
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TABLE J - INTEGRA 
TWO Dl 



NALYSIS FOR MASS SCREENING OF 
'ONAL CEL SEPARATED PROTEINS* 



Potential order 
of analysis 



Identification technique 



Refs 



Amino acid composition 22, 30, 3 1 

Amino acid composition and short N-terminat 23 

Edman microsequencing 

Peptide mass fingerprinting 31-33 

Combination of amino acid composition and 20,21 

peptide mass fingerprinting 

Mass spectrometry sequence tag 24 

Extensive N-terminal Edman microsequencing 34 

Internal peptide Edman microsequencing 35 

Microsequencing by mass spectrometry 25,26 

(electrospray ionization, post-source decay 

MALDI-TOF) 



•Rapid and inexpensive techniques are used as a first step in protein identification, 
and slower more expensive techniques then used, if required. The potential order of 
analysis will change with advances in technology (particularly in mass spectrometry). 



cross- species mapping have proved suc- 
cessful, albeit on a limited scale 20 . The 
results of these integrated approaches, 
which can, be undertaken with as little 
as 250 ng of protein, are dramatic both 
in terms of the success rate of positive 
identifications and the potential through- 
put of samples (see Ref. 22). The meth- 
ods also offer the potential to search 
the expressed-sequence tag (EST) data- 
bases such that a limited amount of 
rapidly obtained data (i.e. sequence 
tag) from a protein spot of interest on 
a 2-D gel could be quickly 'converted' 
into a large amount of sequence infor- 
mation for the corresponding (previ- 
ously uncharacterized) gene. However, 
while rapid and relatively straightfor- 
ward, these approaches do not always 
provide unambiguous identification 
of a protein and are of course depend- 
ent on appropriate sequences being 
present in sequence databases. There 
remains, therefore, an important place 
for extensive protein sequencing by 
Edman degradation or more recently 
developed mass-spectrometry tech- 
niques. The most exciting of these has 
been the use of nano-electrospray 
tandem mass spectrometry to se- 
quence a 2-D-separated and silver- 
stained protein (see article in this issue 
by A. Lamond and M. Mann, and 
Ref. 26). It is likely that the sensitivity 
and speed of protein characterization 
from 2-D gels will increase further and 
that the cost of each protein spot char- 
acterization will decline. Clearly, this 
will be necessary if the products of the 
60 000-80 000 human genes, includ- 
ing all posttranslationally modified 
forms, are to be included in protein 
databases in the foreseeable future. 



Integration of protein sequence 
data with genome sequence 
databases 

The effective use of DNA sequence 
information is facilitated by well- 
maintained arid readily accessible data- 
bases; the same will be true of proteome 
data. A number of 2-D databases have 
been established (see Table 2). In these 
databases, the protein spots on images 
of 2-D gels can be used to identify the 
protein (if its identity is known) and 
to provide a direct link to its protein 
and DNA sequence data within the 
sequence databases and to structural 
information (see Fig. 1 ). Access to the 
2-D databases may also be achieved 
from the sequence databases. As 2-D 
databases grow in numbers of entries, 
so too will the application of the infor- 
mation and their integration with 
other databases, including those that 
document protein changes in disease 
states (Table 2). 

Applications of proteome analysis 

The potential applications of pro- 
teome analysis are numerous. At its 
most straightforward, proteome data 
will be used to confirm open-reading 
frames in DNA sequences. It may 
also provide information on possible 
alternatively spliced forms of mRNAs. 
More importantly, by comprehensively 
monitoring changes in protein abun- 
dance, it may be possible to assign a 
function to novel gene products by 
measuring changes in the proteome 
after overexpression or knockout of the 
gene of interest. Even if the function of 
a gene is known, or suspected, analysis 
of the effect its deletion on tissue/cell 
protein expression will prove valuable, 



particularly for the analysis of trans- 
genics where redundancy in protein 
function may result in little apparent ef- 
fect on phenotype. Alternatively, analy- 
sis of changes in protein expression 
that accompany large changes in 
phenotype of the transgenic may be 
very revealing. 

More obvious are the applications of 
proteome analysis to measurement of 
changes in protein. expression during 
development, through the cell cycle, 
during apoptosis, in diseases and in 
response to extracellular stimuli (hor- 
mones, cytokines, stress), drug treat- 
ment and the application of toxic 
agents. The ability to follow changes 
in the expression of all proteins in a tis- 
sue or cell type rather than orie, two or 
perhaps several as is commonly under- 
taken at present will dramatically in- 
crease our understanding of each of 
these processes. Similarly, analysis of the 
differences in the prdteomes of patho- 
genic versus nonpathogenic bacterial 
strains should prove valuable. Of course, 
proteome analysis will also be used to 
characterize all the posttransiationa! 
modifications, including phosphoryl- 
ation, fatty acylation, acetylation, ADP- 
ribosylation and glycosylation, that 
occur and can change in each of the 
circumstances detailed above. 

It is becoming increasingly apparent 
that the transmission of information 
within cells and many other diverse 
subcellular activities (vesicle transport, 
transcription, mRNA processing, tran- 
sport across the nuclear membrane) 
involve complex and dynamically 
changing protein assemblies. Proteome 
analysis will assist in identifying the 
components of complex protein assem- 
blies either by direct analysis of the 
protein complexes or, for example, by 
analysis of immunoprecipitates of one 
of the protein components of a com- 
plex. By revealing which proteins are 
expressed within a particular cell, it 
will also assist in interpreting whether 
potential combinations of protein inter- 
actions identified by a two-hybrid 
screen actually occur in individual cells. 
In this way, 2-DE and proteome analy- 
sis should play a vital role in the gener- 
ation of cell-specific protein linkage 
maps (see Ref. 27). Furthermore, with 
appropriate prefractionation of samples, 
it will also be possible to analyse the 
subcellular localization and potential 
changes in subcellular localization of 
proteins. This will be particularty useful 
for comprehensive analysis of changes 
in the localization of intracellular sig- 
nalling proteins (see Refs 28 and 29). 
Many of the potential applications 
of proteome analysis were discussed 
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recently at a meeting ('From Genome 
to Proteome') held in Siena. Italy 
(16-18 September .1996) and are 
reported in a special issue of Electro- 
phoresis published in April 1 997. 



Conclusions and future prospects 

It is evident that recent years have 
seen such dramatic and exciting ad- 
vances in 2-DE and methods to char- 
acterize gel-separated proteins that 

the aim of screenin qjQmprphp n ^ Vf ,fy 

the proteins expressed in an organism' 
a tissue or particular cell type is now 
becoming an achievable goal. How- 
ever, a number of technical challenges 
will have to be overcome before pro- 
teome analysis realizes its fuH potential. 
Importantly, in the ma/ority of current 
applications, the 2 0 gels are reveal- 
ing only proteins of high- to-moderate 
abundance, and technical improve- 
ments will be needed to increase the 
sensitivity of protein detection. The 
recent surge of interest and activity in 
the field of proteome analysis greatly 
increases the likelihood that these 
developments will be made. 

In the same way that the increase in 
ONA sequence data raised a number 
f important issues, so too will the 
future expansion in protein data. These 
issues include ensuring the accuracy of 
pr tein characterizations and the valid- 
ity of reported changes in protein 
expression; safeguarding access to 
data; and collaborative management 
and integration of projects to avoid 
unnecessary duplication. Much has 
been learnt from the genome projects, 
and we hope that this article will enl 
nance awareness of proteome-analysis 
projects and assist in the formation of 



a proteome-analysis network to plan, 
manage and integrate existing and 
future proteome projects. 
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Branching out in 
neurobiology 

Nerve Growth and Guidance 
(Frontiers in Neurobiology) 



edited by C D. McCoig, 
Portland Press, J 996. £60.00 
(1 74 pages) ISBN 1 85S78 08S 2 

This book is a concise review f r those 
interested in recent advances in the field 
of neurobiology and makes excellent 
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supplementary reading for senior col- 
lege and graduate students. 

The integrity of the nervous system 
depends upon highly specific connec- 
tions formed between neurons during 
development and regeneration. The 
specificity of these connections requires 
axonal outgrowth along defined path- 
ways. Since the early morphological 
studies of Ramon y Cajal and Harrison, 
the control of events such as axonal 
guidance and target recognition have 
been attributed to the growth cone, a 
specialized structure located at the dis- 
tal tip of growing neurites (Chapter 1). 

Crowth cones are capable of respond- 
ing to various environmental signals. 
Signals such as soluble trophic or tropic 
fact rs and other molecular cues resid- 
ing in the extracellular matrix and cell 



membranes have long been shown to 
be involved in neurite outgrowth and 
guidance (Chapters 1, 6 and 8). Snow 
et al. remind us of another potential 
player, the proteoglycans, that has 
been increasingly implicated in the 
development and maintenance of the 
nervous system (Chapter 6). One re- 
maining puzzle, however, is the ques- 
tion of how the growth-promotion 
molecules - dozens of them identified 
so far - are able to govern the thousands 
of specific connections made among 
different neurons. Recentry, a variety of 
'collapsing' factors were added to this 
fascinating repertoire f regulators, giv- 
ing a Yin-Yang balance to what we know 
about axonal navigation and guidance. 
Bandlow in Chapter 7 and Shewan 
and Cohen in Chapter 8 summarize 
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